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^ (57) Abstract: Methods, computer program(s) and database($) to analyze and make use of gene haplotype information. These in- 
clude methods, program, and database to find and measure the frequency of haplotypes in the general population; methods, program, 
Q and database to find correlation's between an individual's haplotypes or genotypes and a clinical outcome; methods, program, and 
^ database to predict an individual's haplotypes from the individual's genotype for a gene; and methods, program, and database to 
^ predict an individual's clinical response to a treatment based on the individual's genotype or haplotype. 
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reference population is that described in U.S.Application Serial No. 60/198,340 
(inventors Stephens et al.), filed April 18, 2600, which is specifically incorporated 
by reference hferein. Another, less preferred embodiment for haplotyping the 
reference population, uses the CLASPER System™ technology (Ref. U.S. Patent 
5 Number 5,$66,404), which is a technique fbr direct haplotyping. Other examples of 
the techniques for direct haplotyping include single molecule dilution ("SMD") 
PCR (Ref. 9) and allele-specific PCR (Ref. 10). However, for the purpose of this 
invention, any technique for producing the haplotype information may be used. 

The information that is stored in a database, such as a 

10 

database associated with the DecoGen application exemplified herein includes (1) 
the positions of one or more, preferably two or more, most preferably all, of the sites 
in the gene locus (or other loci) that are variable (i.e. polymorphic) across members 
of the reference population and (2) the nucleotides found for each individuals' 2 

15 haplotypes at each of the polymorphic sites. Preferably, it also includes individual 
identifiers and ethnicity or other phenotypic characteristics of each individual. 

In the preferred embodiment of the invention, the haplotypes 
and their frequencies are stored and displayed, preferably in the manner shown, e.g., 

2Q in FIGURES 4A and 4B. Haplotypes and other information about each of the 
members of the population being analyzed can be shown, for example, in the 
manner shown in FIGURE 8. The information shown in FIGURE 8 includes a 
unique identifier (PID), ethnicity, age, gender, the 2 haplotypes seen for the 
individual, and values of all clinical measurements available for the individual. 

25 

Quantitative values of clinical measures would ordinarily be seen by scrolling to the 
right. However, for the subjects seen in this view, there is no clinical data. This is 
because this is the reference population of healthy individuals. 

The haplotype data may also be presented in the context of 

30 the entire DNA sequence. Examples of the sequences of the isogenes, with the 
polymorphisms highlighted, are shown in FIGURE 5. 

Because an individual has 2 copies of the gene (2 isogenes), 
and because these 2 copies are often different, some of the polymorphic sites will 

^ show 2 different nucleotides in a genotype, one from each of the isogenes. A 
genotype from an individual with haplotypes TAC and CAG would be 
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chromosomes in an individual. As used herein, genotype includes a full-genotype 
and/or a sub-genotype as described below. 

Genotyping - A process for determining a genotype of an 

individual 

Haplotype - A member of a polymorphic set, e.g., a 
sequence of nucleotides found at one or more of the polymorphic sites in a locus in 
a single chromosome of an individual. (See, e.g., HAP 1 in FIGURE 4A full 
haplotype is a member of a full polymorphic set). A sub-haplotype is a member of a 
polymorphic subset. 

Haplotype data - Information concerning one or more of the 
following for a specific gene: a listing of the haplotype pairs in each individual in a 
population; a listing of the different haplotypes in a population; frequency of each 
haplotype in that or other populations, and any known associations between one or 
15 more haplotypes and a trait. 

Haplotype pair - The two haplotypes found for a locus in a 

single individual. 

Haplotyping - A process for determining one or more 
20 haplotypes in an individual and includes use of family pedigrees, molecular 
techniques and/or statistical inference. 

Isoform - A particular form of a gene, mRNA, cDN A or the 
protein encoded thereby, distinguished from other forms by its particular sequence 
and/or structure. 

Isogene - One of the two copies (or isoforms) of a gene 
possessed by an individual or one of all the copies (or isoforms) of the gene found in 
a population. An isogene contains all of the polymorphisms present in the particular 
copy (or isoforms) of the gene. 
30 Isolated - As applied to a biological molecule such as RNA, 

DNA, oligonucleotide, or protein, isolated means the molecule is substantially free 
of other biological molecules such as nucleic acids, proteins, lipids, carbohydrates, 
or other material such as cellular debris and growth media. Generally, the term 
"isolated" is not intended to refer to a complete absence of such material or to 
absence of water, buffers, or salts, unless they are present in amounts that 
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